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SEQIENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: The University of Leicester 

(B) STREET: University 

(C) CITY: Leicester 

(E) COUNTRY: United Kir 

(F) POSTAL CODE (ZIP) : 



(ii) TITLE OF INVENTION: Com] 



Road 

gdom 
LEI 7RH 



lement Inhibitor 



(iii) NUMBER OF SEQUENCES: 14 



(iv) COMPUTER READABLE FORM 

(A) MEDIUM TYPE: Floppi" disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 



(EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 4229 base pSairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doubll 

(D) TOPOLOGY : unknown 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



TCGAGTCAAC TGCTCCCAGA TAG ATC CAAG ACATGAGACT GTCAGCAAGA ATTATTTGGC 
TTATATTATG GACTGTTTGT GTAGCAGAAG ATTGTAAAGG TCCTCCTCCA AGAGAAAATT 
CAGAAATTCT CTCAGGTTCG TGGTCTGAAC AflCTATATTC AGAAGGCACT CAGGCAACCT 
ACAAATGCCG CCCTGGATAC CGAACACTTG GTACTATTGT AAAAGTATGC AAGAATGGAG 
AATGGGTACC TTCTAACCCA TCAAGGATAT GTGGGAAAAG GCCATGTGGG CATCCCGGAG 
ACACACCCTT TGGGTCCTTT AGGCTGGCAG TTGGATCTGA ATTTGAATTT GGTGCAAAGG 
TTGTTTATAC ATGTGATGAA GGGTACCAAC TATTIAGGTGA AATTGATTAC CGTGAATGTG 
ATGCAGATGG GTGGAC GAAT GATATTC CAA TATGTGAAGT TGTGAAGTGC TTGCCAGTGA 
CAGAACTGGA GAATGGAAGA ATTGTGAGTG GTGCftGCCGA ACCAGACCAG GAATATTATT 
TTGGACAGGT GGTACGCTTT GAATGCAACT CCGGCTTCAA GATTGAAGGA CAGAAAGAAA 



.TGCACTGCTC AT AAAATGG C CTCTGGAGCA ATG, 



lGCC ACAGTGTGTG GAAATTTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GCCTGCCACC 


ACGAGTTGAA 


AATGGAGATG 


GTATATATCT 


GAAACCAGTT 


TACAAGGAGA 


720 


ATGAAAGATT 


CCAATATAAA 


TGTAAGCAAG 


GTTTTGTGTA 


CAAAGAAAGA 


GGGGATGCTG 


780 


TCTGCACGGG 


TTCTGGATGG 


AATCCTCAGC 


CTTCCTGTGA 


AGAAATGACA 


TGTTTGACTC 


840 


CATATATTCC 


AAATGGTATC 


TACACACCTC 


ACAGGATTAA 


ACACAGAATT 


GATGATGAAA 


900 


TCAGATATGA 


ATGTAAAAAT 


GGCTTCTATC 


CTGCAACCCG 


ATCACCTGTT 


TCAAAGTGTA 


960 


CAATTACTGG 


CTGGATCCCT 


GCTC CAAGAT 


GTAGCTTGAA 


ACCTTGTGAT 


TTTCCACAAT 


1020 


TCAAACATGG 


ACGTCTGTAT 


TATGAAGAAA 


GCCGGAGACC 


CTACTTCCCA 


GTACCTATAG 


1080 


GAAAGGAGTA 


CAGCTATAAC 


TGTGACAACG 


GGTTTACAAC 


GCCTTCACAG 


TCATACTGGG 


1140 


ACTACCTTCG 


TTG CACAGTA 


AATG GGTGGG 


AGCCTGAAGT 


TCCATGCCTC 


AGGCAATGTA 


1200 


TTTTC CATTA 


TGTGGAATAT 


GGAGAATCTT 


CATACTGGCA 


AAGAAGATAT 


ATAGAGGGTC 


1260 


AGTCTGCAAA 


AGTCCAGTGT 


CACAGTGGCT 


ATAGTCTTCC 


AAATGGTCAA 


GATACATATT 


1320 


ATTGTACAGA 


GAATGGCTGG 


TCCCCTCCTC 


CCAAATGCGT 


CCGTATCAAG 


ACTTGTTCAG 


1380 


TATCAGATAT 


AGAAATTGAA 


AATG GGTTTT 


TTTCTGAATC 


TGATTATACA 


TATG CTCTAA 


1440 


ATAGAAAAAC 


ACGGTATAGA 


TGTAAACAGG 


GATATGTAAC 


AAATACCGGA 


GAAATATCAG 


1500 


GAATAATTAC 


TTGTCTTCAA 


GATGGATGGT 


CACCTCGACC 


CTCATGCATT 


AAGT CTTGTG 


1560 


ATATGCCTGT 


ATTTGAGAAT 


TCTATGACTA 


AGAATAATAA 


CACATGGTTT 


AAACTCAATG 


1620 


ACAAATTAGA 


CTATGAATGT 


C ACATTG GAT 


ATGAAAATGA 


ATATAAACAT 


ACCAAAGGCT 


1680 


CTATAACATG 


TACTTATGAT 


GGATGGTCTA 


GTACACCCTC 


CTGTTATGAA 


AGAGAATGCA 


1740 


GCATTCCCCT 


GTTACACCAA 


GACTTAGTTG 


TTTTTCCCAG 


AGAAGTAAAA 


TACAAAGTTG 


1800 


GAGATTCGTT 


GAGTTTCTCT 


TGCCGTTCAG 


GACACAGAGT 


TGGAGCAGAT 


TTAGTGCAAT 


1860 


GCTACCACTT 


TGGATGGTCC 


CCTAATTTCC 


CAACGTGTGA 


AGGCCAAGTA 


AAAT CATGTG 


1920 


ACCAACCTCT 


TGAAATCCCG 


AATGGGGAAA 


TAAAGGGAAC 


AAAAAAAGTT 


GAATACAGCC 


1980 


ATGGTGACGT 


GGTGGAATAT 


GATTGCAAAC 


CTAGATTTCT 


ACTGAAGGGA 


CCCAATAAAA 


2040 


TCCAGTGTGT 


TGACGGGAAG 


TGGACAAGGT 


TGCCGATATG 


CGTTGAGTAT 


GAGAGAACAT 


2100 


GTGGAGACCT 


TCCTGAACTT 


GAGCATGGCT 


CTGTCAAGTT 


ATCTGTCCCT 


CCCTACCATC 


2160 


ATGGAGATTC 


AGTGGAGTTC 


ACTTGTACAG 


AAAC CTTCAC 


AATGATTGGA 


CATGCAGTAG 


2220 


TTTTCTGCAT 


TAGTGGAAGG 


TGGACCGAGC 


TTCCTCAATG 


TGTTGCAACA 


GATCAACTGG 


2280 


AGAAGTGTAA 


AGCCCCGAAG 


TCAACTGGCA 


TAGATGCAAT 


TCATCCAAAT 


AAGAATGAAT 


2340 
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TTAATCATAA 


CTTTAGTGTG 


AGTTACAGAT 


GTAGACAAAA 


GCAGGAGTAT 


GAACATTCAA 


24 00 


TCTGCATCAA 


TGGAAGATGG 


GATCCTGAAC 


CAAACTGTAC 


AAGCAAAAGA 


TTCTGCCCTC 


2460 


CTCCCCCGCA 


GATTCCAAAT 


GCC CAAGTGA 


TTGAAACCAC 


CGTGAAATAC 


TTGGATGGAG 


2520 


AAAAAGTATC 


TGTTCTTTGC 


CAAGATGGTT 


AC CTAACTCA 


GGGCCCAGAA 


GAAATGGTGT 


2580 


GTAAACATGG 


AAGGTGG CAG 


TCGTTACCAC 


GCTGCACGGA 


AAAAATTC C A 


TGTTCCCAGC 


2640 


CCCCTAAAAT 


TGAACATGGA 


TCTATTAAGT 


CGCCCAGGTC 


CTCAGAAGAG 


AGGAGAGATT 


2700 


TAATTGAGTC 


CAGCAGTTAT 


GAACACGGAA 


CTACATTCAG 


CTATTGCTGT 


AGAGATGGAT 


2760 


TCAAGATATC 


TGAAGAAAAT 


AGGGTAACCT 


GCAACATGGG 


AAAATGGAGC 


TCTCTGCCTC 


2820 


GTTGTGTTGG 


AATAC CTTGT 


GGACCCCCAC 


CTTCAATTCC 


TCTTGGTATT 


GTTTCTCATG 


2880 


AACTAGAAAG 


TTACCAATAT 


GGAGAGGAGG 


TTACATACAA 


TTGTTCTGAA 


GGCTTTGGAA 


2940 


TTGATGGACC 


AGCATTTATT 


AAATGTGTAG 


GAGGACAGTG 


GTCTGAACCT 


C C CAAATGC A 


3000 


TAAAAACTGA 


TTGTGACAAC 


TTGCCCACAT 


TTGAAATTGC 


CAAACCGACA 


GAAAAGAAAA 


3060 


AAAAATCATA 


CAGGTCAGGA 


GAACAAGTGA 


CATTCAGATG 


TCCACCTCCG 


TATCGAATGG 


3120 


ATGGCTCTGA 


CATTGTCACA 


TGTGTTAATA 


CGAAGTGGAT 


TGGACAGCCG 


GTATGCAAAG 


3180 


ATAATTCCTG 


TGTGAATCCA 


CCACATGTGC 


CAAATGCTAC 


TATACTAACA 


AGGCACAAGA 


3240 


CTAAATATCC 


ATCTGGTGAC 


AAAGTACGTT 


ATGACTGTAA 


TAAACCTTTT 


GAATTATTTG 


3300 


GGGAAGTGGA 


AGTGATGTGC 


CAAAACGGGA 


TTTGGACAGA 


AC C AC CGAAA 


TG CAAAGATT 


3360 


CAACAGGGAA 


ATGTGGGCCT 


CCTCCACCTA 


TTGACAATGG 


AGACAT CACC 


T C CTTGT CAT 


3420 


TACCAGTATA 


TGCACCATTA 


TCATCAGTTG 


AATATCAATG 


C C AG AACTAT 


TATCTACTTA 


3480 


AGGGAAATAA 


GATAGTAACA 


TGTAGAAATG 


GAAAGTGGTC 


TCAGCCACCA 


AC CTGCTTAC 


3540 


ATGCATGTGT 


GAT AC C AGAA 


GATATTATGG 


AAAAACATAA 


TATAGTTCTC 


AGATGGAGGG 


3600 


AAAATGCAAA 


GATTTATTCC 


CAATCAGGGG 


AGAATATTGA 


ATTCATGTGT 


AAACCTGGAT 


^ r** r\ 

3660 


ATAGAAAATT 


CAGAGGATCA 


CCTCCGTTTC 


GTACAAAGTG 


CATTGAGGGT 


CACATCAATT 


3720 


ATCCCACTTG 


TGTATAAAAT 


CGCTATACAA 


TTATTAGTAA 


AC CTTATGGA 


TGAGAAATGC 


3780 


ACATGTATAT 


TACTAATACA 


G TTTG AATTT 


ACATTTAAAT 


ATTGTTTAGC 


TCATTTCCTC 


3840 


TAATAAGTAT 


ATAAACTTTT 


TTTATATGGT 


GGTTAATCAG 


TAACTTTACA 


GACTGTTGCC 


3900 


ACAAAGCAAG 


AACATTACAT 


TCAAAACTCC 


T AATC CAAAT 


ATGATATGTC 


CAAGGACAAA 


3960 


CTATGTCTAA 


GCAAGAAAAT 


AAATGTTAGT 


TCTTCAATGT 


CTGTTTTTAT 


TCAGGACCTT 


4020 
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TCAGATTTTC TTGGATACCT TTTGTTAGGT TCTGATTCAC AGTGAGTGGA AGACACACTG 
ACTCTGACTT CAAATTAGTA TTACTTGCAA TACATTAACA ACCAAACTAT CATAATATCA 
CAAATGTATA CAGCTAATTA CTGTGTCCTA CCTTTGTATC AATAAAGAAA TCTAAGAAAG 
TTCTTGCTTA AAAAAAAAAA AAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
( D ) TOPOLOGY : unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown- 



4080 
4140 
4200 
4229 







TAGATCCAAG 


ACATGAGACT 


GTCAGCAAGA 


ATTATTTGGC 


60 


TTATATTATG 


GACTGTTTGT 


GTAGCAGAAG 


ATTGTAAAGG 


TCCTCCTCCA 


AGAGAAAATT 


120 


CAGAAATTCT 


CTCAGGTTCG 


TGGTCTGAAC 


AACTATATTC 


AGAAGGCACT 


CAGGCAACCT 


180 


ACAAATGCCG 


CCCTGGATAC 


CGAACACTTG 


GTACTATTGT 


AAAAGTATGC 


AAGAATGGAG 


240 


AATGGGTACC 


TTCTAACCCA 


TCAAGGATAT 


GTCGGAAAAG 


GCCATGTGGG 


CATCCCGGAG 


300 


ACACACCCTT 


TGGGTCCTTT 


AGGCTGGCAG 


TTGGATCTGA 


ATTTGAATTT 


GGTGCAAAGG 


360 


TTGTTTATAC 


ATGTGATGAA 


GGGTACCAAC 


TATTAGGTGA 


AATTGATTAC 


CGTGAATGTG 


420 


ATGCAGATGG 


GTGGACCAAT 


GAT ATT C CAA 


TATGTGAAGT 


TGTGAAGTGC 


TTGCCAGTGA 


480 


CAGAACTGGA 


GAATGGAAGA 


ATTGTGAGTG 


GTGCAGCCGA 


AC C AG AC C AG 


GAATATTATT 


540 


TTGGACAGGT 


GGTACGCTTT 


GAATG CAACT 


C CGGCTTC AA 


GATTGAAGGA 


CAGAAAGAAA 


600 


TGCACTGCTC 


ATAAAATGGC 


CTCTGGAGCA 


ATGAAAAGCC 


ACAGTGTGTG 


GAAATTTCTT 


660 


GCCTGCCACC 


ACGAGTTGAA 


AATGGAGATG 


GATATAGAAA 


ATTCAGAGGA 


TCACCTCCGT 


720 


TTCGTACAAA 


GTG CATTG AG 


GGTCACATCA 


ATTATCCCAC 


TTGTGTATAA 


AATCGCTATA 


780 


CAATTATTAG 


TAAAC CTTAT 


GGATGACACT 


TTGTTTAGAA 


ATGCACATGT 


ATATTACTAA 


840 


TACAGTTTGA 


ATTTACATTT 


GAAAAA 








866 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

TCGAGTCAAC TGCTCCCAGA TAGATCCAAG ACATGAGACT GTCAGCAAGA ATTATTTGGC 60 

TTATATTATG GACTGTTTGT GTAGCAGAAG ATTGTAAAGG TCCTCCTCCA AGAGAAAATT 120 

CAGAAATTCT CTCAGGTTCG TGGTCTGAAC AACTATATTC AGAAGGCACT CAGGCAACCT 180 

ACAAATGCCG CCCTGGATAC CGAACACTTG GTACTATTGT AAAAGTATGC AAGAATGGAG 24 0 

AATGGGTACC TTCTAACCCA TCAAGGATAT GTCGGAAAAG GCCATGTGGG CATCCCGGAG 300 

ACACACCCTT TGGGTCCTTT AGG CTGGCAG TTGGATCTGA ATTTGAATTT GGTG CAAAGG 360 

TTGTTTATAC ATGTGATGAA GGGTACCAAC TATTAGGTGA AATTGATTAC CGTGAATGTG 420 

ATGCAGATGG GTGGACCAAT GATATTCCAA TATGTGAAGT TGTGAAGTGC TTGC CAGTGA 480 

CAGAACTGGA GAATGGAAGA ATTGTGAGTG GTGCAGCCGA ACCAGAC CAG GAATATTATT 540 

TTGGACAGGT GGTACGCTTT GAATGCAACT CCGGCTTCAA GATTGAAGGA CAGAAAGAAA 600 

TGCACTGCTC ATAAAATGGC CTCTGGAGCA ATGAAAAGCC ACAGTGTGTG TTGAAACCTT 660 

GTGATTTTCC ACAATTCAAA CATGGACGTC TGTATTATGA AGAAAG CCGG AGACCCTACT 720 

TCCCAGTACC TATAGGAAAG GAGTACAGCT ATAACTGTGA CAACGGGTTT ACAACGCCTT 78 0 

CACAGTCATA CTGGGACTAC CTTCGTTGCA CAGTAAATGG GTGGGAGCCT GAAGTTCCAT ~ 84 0 

GCCTCAGGCA ATGTATTTTC CATTATGTGG AATATGGAGA ATCTTCATAC TGGCAAAGAA 900 

GATATATAGA GGGTCAGTCT GCAAAAGTCC AGTGTCACAG TGGCTATAGT CTTCCAAATG 96 0 

GTCAAGATAC ATATTATTGT ACAGAGAATG GCTGGTCCCC TCCTCCCAAA TGCGTCCGTA 102 0 

TCAAGACTTG TTCAGTATCA GATATAGAAA TTGAAAATGG GTTTTTTTCT GAAT CTGATT 108 0 

ATACATATGC TCTAAATAGA AAAACACGGT ATAGATGTAA ACAGGGATAT GTAACAAATA 114 0 

CCGGAGAAAT ATCAGGAATA ATTACTTGTC TTCAAGATGG ATGGTCACCT CGACCCTCAT 12 00 

GCATTAAGTC TTGTGATATG CCTGTATTTG AGAATTCTAT GACTAAGAAT AATAACACAT 126 0 

GGTTTAAACT CAATGACAAA TTAGACTATG AATGT CAC AT TGGATATGAA AATGAATATA 13 2 0 

AACATACCAA AGGCTCTATA ACATGTACTT ATGATGGATG GTCTAGTACA CCCTCCTGTT 13 8 0 

ATGAAAGAGA ATGCAGCATT CCCCTGTTAC AC CAAG ACTT AGTTGTTTTT CCCAGAGAAG 144 0 

TAAAATACAA AGTTGGAGAT TCGTTGAGTT TCTCTTGCCG TTCAGGACAC AGAGTTGGAG 1500 

CAGATTTAGT GCAATG CTAC CACTTTGGAT GGTCCCCTAA TTTCCCAACG TGTGAAGGCC 156 0 

AAGTAAAATC ATGTGACCAA CCTCTTGAAA TCCCGAATGG GGAAATAAAG GGAACAAAAA 1620 
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AAGTTGAATA 


CAGCCATGGT 


GACGTGGTGG 


AATAxCjAI xvj 


P A 71 A PPTAGA 


TTT CTACTGA 

A ah •> «B> -f »^B» *»» ^■T«> ^ 


1680 


AGGGACCCAA 


TAAAATCCAG 


TGTGTTGACG 


GGAAGTUGAL 




ATATGCGTTG 


1740 


AGTATGAGAG 


AACATGTGGA 


GACCTTCCTG 


tv TV r»rprnn tv O ^ 7A 

AACTTGACa L. A 


X UV3L X V— X VJ ± V— 


AAGTTATCTG 


1800 


TCCCTCCCTA 


CCATCATGGA 


GATTCAGTGG 


AGTTCALTIb 


*t a pa n A A A CP 


TT CA CAATGA 


1860 


TTGGACATGC 


AGTAGTTTTC 


TGCATTAGTG 


GAAGGTGGAL. 


K~\J+i\j{- X X V— V_ X 


PAATGTGTTG 


1920 


CAACAGATCA 


ACTGGAGAAG 


TGTAAAGCCC 


CGAAGTCAAC 


T* 7A TP A n A T* 




19B0 


CAAATAAGAA 


TGAATTTAAT 


CATAACTTTA 


GTGTGAGTTA 


C AG A lul At? A 




2040 


AGTATGAACA 


TTCAATCTGC 


ATCAATGGAA 


GATGGGATCC 


W^"* 7\ TV f**f^ TV TV TV 

TGAAC LAAAL 




2100 


AAAGATTCTG 


CCCTCCTCCC 


CCGCAGATTC 


CAAATGCCCA 


AGTGATTGAA 


7\ r*P a P Pf3*Tf3 A 


2160 


AATACTTGGA 


TGGAGAAAAA 


GTATCTGTTC 


TTTGCCAAGA 


i""*^*^*"/ m^lV Ti 

TGGTTACCTA 


X i-iwVjvjV*- v^ 


222 0 


CAGAAGAAAT 


GGTGTGTAAA 


CATGGAAGGT 


GGCAGTCGTT 


ACCACGC x Gt_ 


tv r*nn a A A A A A 


228 0 


TTCCATGTTC 


CCAGCCCCCT 


AAAATTGAAC 


ATGGATCTAT 


TAAGT CGC C_ C 


7A rtriTP PTP Afi 
/-Vvjvji iv»v,l v»nV3 


2340 


AAGAGAGGAG 


AGATTTAATT 


GAGTCCAGCA 


GTTATGAACA 


^^/^/^ TV TV /^T 1 A TV 

C GGAAC x ALA 


TTP A^SPTATT 


2400 


GCTGTAGAGA 


T GG ATT C AAG 


Tv T"<TV T 1 f«»TV* TV 7\ f^** 

ATA1 U X VjAAvj 


7a a a a TT^aaa^r 


AAC CTGCAAC 


ATGGGAAAAT 


2460 


GGAGCTCTCT 


GCCTCGTTGT 


GTTGGAATAC 


CTTGTGGACC 


CCCACCTTCA 


ATTCCTCTTG 


2520 


GTATTGTTTC 


TCATGAACTA 


GAAAGTTACC 


AATATGGAGA 


GGAGGTTACA. 


TACAATTGTT 


2580 


CTGAAGGCTT 


TGGAATTGAT 


GGACCAGCAT 


TTATTAAATG 


TGTAGGAGGA 


CAGTGGTCTG 


2640 


AACCTCCCAA 


ATGCATAAAA 


ACTGATTGTG 


ACAACTTGCC 


CACATTTGAA 


ATTGCCAAAC 


2700 


CGACAGAAAA 


GAAAA 










2715 



( 2 ) INFORMATION FOR SEQ ID NO: 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 
TCGAGTCAAC TGCTCCCAGA TAGATCCAAG ACATGAGACT GTCAGCAAGA ATTATTTGGC 
TTATATTATG GACTGTTTGT GTAGCAGAAG ATTGTAAAGG TCCTCCTCCA AGAGAAAATT 
CAGAAATTCT CTCAGGTTCG TGGTCTGAAC AACTATATTC AGAAGGCACT CAGGCAACCT 
• ACAAATGCCG CCCTGGATAC CGAACACTTC GTACTATTGT AAAAGTATGC AAGAATGGAG 
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AATGGGTACC TTCTAACCCA TCAAGGATAT GTCGGAAAAG GCCATGTGGG CATCCCGGAG 300 

ACACACCCTT TGGGTCCTTT AGGCTGGCAG TTGGATCTGA ATTTGAATTT GGTGCAAAGG 360 

TTGTTTATAC ATGTGATGAA GGGTACCAAC TATTAGGTGA AATTGATTAC CGTTATCGAA 420 

TGGATGGCTC TGACATTGTC ACATGTGTTA ATACGAAGTG GATTGGACAG CCGGTATGCA 480 

AAGATAATTC CTGTGTGAAT CCACCACATG TGCCAAATGC TACTATACTA ACAAGGCACA 54 0 

AGACTAAATA TCCATCTGGT GACAAAGTAC GTTATGACTG TAATAAACCT TTTGAATTAT 600 

TTGGGGAAGT GGAAGTGATG TGCCAAAACG GGATTTGGAC AGAAC CACCG AAATGCAAAG 66 0 

ATTCAACAGG GAAATGTGGG CCTCCTCCAC CTATTGACAA TGGAGACATC ACCTCCTTGT 72 0 

CATTACCAGT ATATGCACCA TTATCATCAG TTGAATATCA ATGCCAGAAC TATTATCTAC 78 0 

TTAAGGGAAA TAAGATAGTA ACATGTAGAA ATGGAAAGTG GTCTCAGCCA CCAACCTGCT 84 0 

TACATGCATG TGTGATACCA GAAGATATTA TGGAAAAACA TAATATAGTT CTCAGATGGA 90 0 

GGGAAAATGC AAAGATTTAT TCCCAATCAG GGGAGAATAT TGAATTCATG TGTAAACCTG 96 0 

GATATAGAAA ATTCAGAGGA TCACCTCCGT TTCGTACAAA GTGCATTGAG GGTCACATCA 102 0 

ATTATCCCAC TTGTGTATAA AATCGCTATA CAATTATTAG TAAACCTTAT GGATGAGAAA 1080 

TGCACATGTA TATTACTAAT ACAGTTTGAA TTTACATTTA AATATTGTTT AGCTCATTTC 114 0 

CTCTAATAAG TATATAAACT TTTTTTATAT GGTGGTTAAT CAGTAACTTT ACAGACTGTT 12 00 

GCCACAAAGC AAGAACATTA CATTCAAAAC TC CTAATC CA AATATGATAT GTC CAAGGAC 126 0 

AAACTATGTC TAAGCAAGAA AATAAATGTT AGTTCTTCAA TGTCTGTTTT TATTCAGGAC 132 0 

CTTTCAGATT TTCTTGGATA CCTTTTGTTA GGTTCTGATT CACAGTGAGT GGAAGACACA 13 8 0 

CTGACTCTGA CTTCAAATTA GTATTACTTG CAATACATTA ACAACCAAAC TATCATAATA 144 0 

TCACAAATGT ATAC AG CTAA TTACTGTGTC CTACCTTTGT ATCAATAAAG AAATCTAAGA 1500 

AAGTTCTTGC TTAAAAAAAA AAAAAAAAAA AA 1532 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

TTCAAGTAAC GTTAGAAGCT TAAGATG - 27 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
GGCGGCCGCT CAAATCTTCT GAGATATAGG AGA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
GGCGGCCGCT CATTTAATCC TTAAAGGTGA GTA 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGCGGCCGCT CATACTGGAA AGTATGGTCT ACG 
(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Glu He Leu Thr 
15 10 15 

Gly Ser Trp Ser Asp Gin Thr Tyr Pro Glu Gly Thr Gin Ala He Tyr 
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Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Val lie Met Val Cys 
35 40 45 

Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys Gin Lys 
50 55 60 

Arg Pro Cys Gly' His Pro Gly Asp Thr Pro Phe Gly Thr Phe Thr Leu 
65 70 75 80 

Thr Gly Gly Asn Val Phe Glu Tyr Gly Val Lys Ala Val Tyr Thr Cys 

85 90 ^5 

Asn Glu Gly Tyr Gin Leu Leu Gly Glu He Asn Tyr Arg Glu Cys Asp 

100 105 HO 

Thr Asp Gly Trp Thr Asn Asp He Pro He Cys Glu Val Val Lys Cys 
115 120 125 

Leu Pro Val Thr Ala Pro Glu Asn Gly Lys He Val Ser Ser Ala Met 
130 135 140 

Glu Pro Asp Arg Glu Tyr His Phe Gly Gin Ala Val Arg Phe Val Cys 
145 150 155 160 

Asn Ser Gly Tyr Lys He Glu Gly Asp Glu Glu Met His Cys Ser Asp 

165 170 175 

Asp Gly Phe Trp Ser Lys Glu Lys Pro Lys Cys Val Glu He Ser Cys 

180 185 1^0 

Lys Ser Pro Asp Val He Asn Gly Ser Pro He Ser Gin Lys He 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Glu He Leu Thr 
1 5 10 15 

Gly Ser Trp Ser Asp Gin Thr Tyr Pro Glu Gly Thr Gin Ala He Tyr 

20 25 30 

Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Val He Met Val Cys 
35 40 45 



Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys Gin Lys 
50 55 60 
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Arg Pro Cys Gly His Pro Gly Asp Thr Pro Phe Gly Thr Phe Thr Leu 
65 70 75 80 

Thr Gly Gly Asn Val Phe Glu Tyr Gly Val Lys Ala Val Tyr Thr Cys 

85 90 95 

Asn Glu Gly Tyr Gin Leu Leu Gly Glu He Asn Tyr Arg Glu Cys Asp 

100 105 HO 

Thr Asp Gly Trp Thr Asn Asp He Pro He Cys Glu Val Val Lys Cys 
115 120 I 25 

Leu Pro Val Thr Ala Pro Glu Asn Gly Lys He Val Ser Ser Ala Met 
130 135 140 

Glu Pro Asp Arg Glu Tyr His Phe Gly Gin Ala Val Arg Phe Val Cys 
145 150 155 160 

Asn Ser Gly Tyr Lys He Glu Gly Asp Glu Glu Met His Cys Ser Asp 

165 170 I 75 

Asp Gly Phe Trp Ser Lys Glu Lys Pro Lys Cys Val Glu He Ser Cys 

180 185 190 

Lys Ser Pro Asp Val He Asn Gly Ser Pro He Ser Gin Lys He He 
195 200 205 

Tyr Lys Glu Asn Glu Arg Phe Gin Tyr Lys Cys Asn Met Gly Tyr Glu 
210 215 220 

Tyr Ser Glu Arg Gly Asp Ala Val Cys Thr Glu Ser Gly Trp Arg Pro 
225 230 235 240 

Leu Pro Ser Cys Glu Glu Lys Ser Cys Asp Asn Pro Tyr He Pro Asn 

245 250 255 

Gly Asp Tyr Ser Pro Leu Arg He Lys 

260 265 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Glu He Leu Thr 
15 10 15 

Glv Ser Trp Ser Asp Gin Thr Tyr Pro Glu Gly Thr Gin Ala He Tyr 

20 25 30 
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Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Val lie Met Val Cys 
35 40 45 

Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys Gin Lys 
50 55 60 

Arg Pro Cys Gly His Pro Gly Asp Thr Pro Phe Gly Thr Phe Thr Leu 
65 70 75 80 

Thr Gly Gly Asn Val Phe Glu Tyr Gly Val Lys Ala Val Tyr Thr Cys 

85 90 9$ 

Asn Glu Gly Tyr Gin Leu Leu Gly Glu lie Asn Tyr Arg Glu Cys Asp 

100 105 HO 

Thr Asp Gly Trp Thr Asn Asp lie Pro lie Cys Glu Val Val Lys Cys 
115 120 125 

Leu Pro Val Thr Ala Pro Glu Asn Gly Lys lie Val Ser Ser Ala Met 
130 135 140 

Glu Pro Asp Arg Glu Tyr His Phe Gly Gin Ala Val Arg Phe Val Cys 
145 150 155 160 

Asn Ser Gly Tyr Lys lie Glu Gly Asp Glu Glu Met His Cys Ser Asp 

165 170 175 

Asp Gly Phe Trp Ser Lys Glu Lys Pro Lys Cys Val Glu lie Ser Cys 

180 185 190 

Lys Ser Pro Asp Val lie Asn Gly Ser Pro lie Ser Gin Lys lie lie 
195 200 205 

Tyr Lys Glu Asn Glu Arg Phe Gin Tyr Lys Cys Asn Met Gly Tyr Glu 
210 215 220 

Tyr Ser Glu Arg Gly Asp Ala Val Cys Thr Glu Ser Gly Trp Arg Pro 
225 230 235 240 

Leu Pro Ser Cys Glu Glu Lys Ser Cys Asp Asn Pro Tyr lie Pro Asn 

245 250 255 

Gly Asp Tyr Ser Pro Leu Arg lie Lys His Arg Thr Gly Asp Glu lie 

260 265 270 

Thr Tyr Gin Cys Arg Asn Gly Phe Tyr Pro Ala Thr Arg Gly Asn Thr 
275 280 285 

Ala Lys Cys Thr Ser Thr Gly Trp He Pro Ala Pro Arg Cys Thr Leu 
290 295 300 

Lys Pro Cys Asp Tyr Pro Asp He Lys His Gly Gly Leu Tyr His Glu 
305 ' 310 315 320 



Asn Met Arg Arg Pro Tyr Phe Pro Val 

325 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCTCCTCCTG GAAATGTTAG AAGCTTAAGA TG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCTCTAGATT ACTTGATACG GACGCATTT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Glu Asp Cys Lys Gly Pro Pro Pro Arg Glu Asn Ser Glu He Leu Ser 
15 10 15 

Gly Ser Trp Ser Glu Gin Leu Tyr Ser Glu Gly Thr Gin Ala Thr Tyr 

20 25 30 

Lys Cys Arg Pro Gly Tyr Arg Thr Leu Gly Thr He Val Lys Val Cys 
35 40 45 

Lys Asn Gly Glu Trp Val Pro Ser Asn Pro Ser Arg He Cys Arg Lys 
50 55 60 

Arg Pro Cys Gly His Pro Gly Asp Thr Pro Phe Gly Ser Phe Arg Leu 
65 70 75 80 

Ala Val Gly Ser Glu Phe Glu Phe Gly Ala Lys Val Val Tyr Thr Cys 

85 90 95 

Asp Glu Gly Tyr Gin Leu Leu Gly Glu He Asp Tyr Arg Glu Cys Asp 
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f 
v. 



100 105 .H° 

Ala Asp Gly Trp Thr Asn Asp lie Pro He Cys Glu Val Val Lys Cys 
115 120 125 

Leu Pro Val Thr Glu Leu Glu Asn Gly Arg He Val Ser Gly Ala Ala 
130 135 140 

Glu Pro Asp Gin Glu Tyr Tyr Phe Gly Gin Val Val Arg Phe Glu Cys 
145 150 155 160 

Asn Ser Gly Phe Lys He Glu Gly Gin Lys Glu Met His Cys Ser Glu 

165 170 175 

Asn Gly Leu Trp Ser Asn Glu Lys Pro Gin Cys Val Glu He Ser Cys 

180 185 190 

Leu Pro Pro Arg Val Glu Asn Gly Asp Gly He Tyr Leu Lys Pro Val 
195 200 205 

Tyr Lys Glu Asn Glu Arg Phe Gin Tyr Lys Cys Lys Gin Gly Phe Val 
210 215 220 

Tyr Lys Glu Arg Gly Asp Ala Val Cys Thr Gly Ser Gly Trp Asn Pro 
225 230 235 240 

Gin Pro Ser Cys Glu Glu Met Thr Cys Leu Thr Pro Tyr He Pro Asn 

245 250 255 

Gly He Tyr Thr Pro His Arg He Lys His Arg He Asp Asp Glu He 

260 265 270 

Arg Tyr Glu Cys Lys Asn Gly Phe Tyr Pro Ala Thr Arg Ser Pro Val 
275 280 285 

Ser Lys Cys Thr He Thr Gly Trp He Pro Ala Pro Arg Cys Ser Leu 
290 295 300 

Lys Pro Cys Asp Phe Pro Gin Phe Lys His Gly Arg Leu Tyr Tyr Glu 
305 310 315 320 

Glu Ser Arg Arg Pro Tyr Phe Pro Val Pro He Gly Lys Glu Tyr Ser 

325 330 335 

Tyr Tyr Cys Asp Asn Gly Phe Thr Thr Pro Ser Gin Ser Tyr Trp Asp 

340 345 350 

Tyr Leu Arg Cys Thr Val Asn Gly Trp Glu Pro Glu Val Pro Cys Leu 
355 360 365 

Arg Gin Cys He Phe His Tyr Val Glu Tyr Gly Glu Ser Ser Tyr Trp 
370 375 380 

Gin Arg Arg Tyr He Glu Gly Gin Ser Ala Lys Val Gin Cys His Ser 
385 390 395 400 
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Gly Tyr Ser Leu Pro Asn Gly Gin Asp Thr Tyr Tyr Cys Thr Glu Asn 

405 410 415 



Gly Trp Ser Pro Pro Pro Lys Cys Val Arg lie Lys 

420 425 
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Figure 1 

10 20 30 -18 40 50 60 

tcgagtcaactgctcccagatagatccaagac^XSAGACTGTCAGCAAGAATTATTTGGC rFH4 . 3 

tcgagtcaactgctcccagatagatccaagacMSAGACTGTCAGCAAGAATTATTTGGC rFH2 . 7 

tcgagtcaactgctcccagatagatccaagac^ISAGACTGTCAGC^GAATTATTTGGC rFHl . 8 

tcgagtcaactgctcccaga tagatccaag rFHl . 0 



SCRl 

70 80 +1 90 100 110 120 

TTATATTATGGACTGTTTGTGTAGCAS^aGATTGTAAAGGTCCTCCTCCAAGAGAAAATT rFH4 . 3 

TTATATTATGGACTGTTTGTGTAGCAC^GATTGTAAAGGTCCTCCTCCAAGAGAAAATT rFH2 . 7 

TTATATTATGGACTGTTTGTGTAGCASSAGATTGTAAAGGTCCTCCTCCAAGAGAAAATT rFHl . 8 

TTATATTATGGACTGTTTGTGTAGCAGA^GATTGTAAAGGTCCTCCTCCAAGAGAAAATT rFHl . 0 



130 140 150 160 170 180 

CAGAAATTCTCTCAGGTTCGTGGTCTGAACAACTATATTCAGAAGGCACTCAGGC^^CCT rFH4 . 3 

CAGAAATTCTCTCAGGTTCGTGGTCTGAACAACTATATTCAGAAGGCACTCAGGCAACCT rFH2 . 7 

CAGAAATTCTCTCAGGTTCGTGGTCTGAACAACTATATTCAGAAGGCACTCAGGCAACCT rFHl . 8 

CAGAAATTCTCTCAGGTTCGTGGTCTGAACAACTATATTCAGAAGGCACTCAGGCAACCT rFHl . 0 



190 200 210 220 230 240 

ACAAATGCCGCCCTGGATACCGAACACTTGGTACTATTGTAAAAGTATGCAAGAATGGAG rFH4 . 3 

ACAAATGCCGCCCTGGATACCGAACACTTGGTACTATTGTAAAAGTATGCAAGAATGGAG rFH2 . 7 

ACAAATGCCGCCCTGGATACCGAACACTTGGTACTATTGTAAAAGTATGCAAGAATGGAG rFHl . 8 

ACAAATGCCGCCCTGGATACCGAACACTTGGTACTATTGTAAAAGTATGCAAGAATGGAG rFHl . 0 



SCR2a 

250 260 270 280 290 300 

AATGGGTACCTTCTAACCCATCAAGGATATGTCGGAAAAGGCCATGTGGGCATCCCGGAG rFH4 . 3 
AATGGGTACCTTCTAACCCATCAAGGATATGTCGGAAAAGGCCATGTGGGCATCCCGGAG rFH2 . 7 
AATGGGTACCTTCTAACCCATCAAGGATATGTCGGAAAAGGCCATGTGGGCATCCCGGAG rFHl . 8 
AATGGGTACCTTCTAACCCATCAAGGATATGTCGGAAAAGGCCATGTGGGCATCCCGGAG rFHl . 0 



310 320 330 340 350 360 

ACACACCCTTTGGGTCCTTTAGGCTGGCAGTTGGATCTGAATTTGAATTTGGTGCAAAGG rFH4 . 3 

ACACACCCTTTGGGTCCTTTAGGCTGGCAGTTGGATCTGAATTTGAATTTGGTGCAAAGG rFH2 . 7 

ACACACCCTTTGGGTCCTTTAGGCTGGCAGTTGGATCTGAATTTGAATTTGGTGCAAAGG rFHl . 8 

ACACACCCTTTGGGTCCTTTAGGCTGGCAGTTGGATCTGAATTTGAATTTGGTGCAAAGG rFHl . 0 
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370 380 390 400 410 420 

TTGTTTATACATGTGATGJ^GGGTACC^CTATTAGGTGAAATTGATTACCGTGAATGTG rFH4 . 3 
TTGTTTATACATGTGATGAAGGGTACCAACTATTAGGTGAAATTGATTACCGTGAATGTG r FH2 . 7 

TTGTTTATACATGTGATGAAGGGTACCAACTATTAGGTGAAATTGATTACCGT r FH1 . 8 

TTGTTTATACATGTGATGAAGGGTACCAACTATTAGGTGAAATTGATTACCGTGAATGTG rFHl . 0 

SCR3 

430 440 450 " 460 470 480 

ATGCAGATGGGTGGACCAATGATATTCCAATATGTGAAGTTGTGAAGTGCTTGCCAGTGA rFH4 . 3 
ATGCAGATGGGTGGACCAATGATATTCCAATATGTGAAGTTGTGAAGTGCTTGCCAGTGA rFH2 . 7 

rFHl . 8 

ATGCAGATGGGTGGACCAATGATATTCCAATATGTGAAGTTGTGAAGTGCTTGCCAGTGA rFHl . 0 



490 500 510 520 530 540 

CAGAACTGGAGAATGGAAGAATTGTGAGTGGTGCAGCCGAACCAGACCAGGAATATTATT rFH4 . 3 

CAGAACTGGAG AATGGAAGAATTGTG AGTGGTG CAGC CGAAC CAGAC CAGG AAT ATT ATT rFH2 - 7 

rFHl . 8 

CAGAACTGGAGAATGGAAGAATTGTG AGTGGTG CAGCCGAAC CAGACCAGGAATATTATT rFHl . 0 



550 560 570 580 590 600 

TTGGACAGGTGGTACGCTTTGAATGCAACTCCGGCTTCAAGATTGAAGGACAGAAAGAAA rFH4 . 3 
TTGGACAGGTGGTACGCTTTGAATGCAACTCCGGCTTCAAGATTGAAGGACAGAAAGAAA rFH2 . 7 

rFHl . 8 

TTGGACAGGTGGTACGCTTTGAATGCAACT C CGGCTTCAAGATTGAAGGACAGAAAGAAA rFHl . 0 



SCR4 

610 620 630 640 650 660 

TGCACTGCTCATAAAATGGC CT CTGGAG CAATGAAAAGCCACAGTGTGTGGAAATTTCTT rFH4 . 3 

TGCACTGCTCATAAAATGGCCTCTGGAGCAATGAAAAGCCACAGTGTGTG rFH2 . 7 

rFHl . 8 

TGCACTGCTCATAAAATGGCCTCTGGAGCAATGAAAAGCCACAGTGTGTGGAAATTTCTT rFHl . 0 



670 680 690 700 710 720 

G C CTGCCAC CACGAGTTGAAAATGGAGATGGTATATATCTGAAACCAGTTTACAAGGAGA rFH4 . 3 

rFH2 . 7 

rFHl . 8 

GCCTGCCACCACGAGTTGAAAATGGAGAT rFHl . 0 
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730 740 750 760 770 780 

ATGAAAGATTCCAATATAAATGTAAGCAAGGTTTTGTGTACAAAGAAAGAGGGGATGCTG rFH4 . 3 

rFH2 . 7 

rFHl . 8 

rFHl . 0 



SCR5 

790 800 810 820 830 840 

TCTGCACGGGTTCTGGATGGAATCCTCAGCCTTCCTGTGAAGAAATGACATGTTTGACTC rFH4 . 3 

rFH2 . 7 

rFHl . 8 

rFHl . 0 



850 860 870 880 890 900 
CATATATTCCAAATGGTATCTACACACCTCACAGGATTAAACACAGAATTGATGATGAAA rFH4 . 3 
_ rFH2.7 

rFHl . 8 

rFHl . 0 



910 920 930 940 950 960 

TCAGATATGAATGTAAAAATGGCTTCTATCCTGCAACCCGATCACCTGTTTCAAAGTGTA rFH4 . 3 

_ rFH2 . 7 

rFHl . 8 

rFHl . 0 



SCR.6 

970 980 990 1000 1010 1020 

CAATTACTGGCTGGATCCCTGCTCCAAGATGTAGCTTGAAACCTTGTGATTTTCCACAAT rFH4 . 3 

TTGAAACCTTGTGATTTTCCACAAT rFH2 . 7 

rFHl . 8 

rFHl . 0 



103 0 1040 1050 1060 1070 1080 

TCAAACATGGACGTCTGTATTATGAAGAAAGCCGGAGACCCTACTTCCCAGTACCTATAG rFH4 . 3 
TCAAACATGGACGTCTGTATTATGAAGAAAGCCGGAGACCCTACTTCCCAGTACCTATAG rFH2 . 7 

rFHl . 8 
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1090 1100 1110 1120 1130 1140 

GAAAGGAGTACAGCTATAACTGTGACAACGGGTTTACAACGCCTTCACAGTCATACTGGG rFH4 . 3 
GAAAGGAGTACAGCTATAACTGTGACAACGGGTTTACAACGCCTTCACAGTCATACTGGG rFH2 . 7 

. rFH1.8 

rFHl . 0 



SCR 7 

1150 1160 1170 1180 1190 1200 

ACTACCTTCGTTGCACAGTAAATGGGTGGGAGCCTGAAGTTCCATGCCTCAGGCAATGTA rFH4 . 3 
ACTACCTTCGTTGCACAGTAAATGGGTGGGAGCCTGAAGTTCCATGCCTCAGGCAATGTA ^ FH2 - 7 

. rFHl. 8 

rFHl . 0 



1210 1220 1230 1240 1250 1260 

TTTTCCATTATGTGGAATATGGAGAATCTTCATACTGGCAAAGAAGATATATAGAGGGTC rFH4 . 3 
TTTTCCATTATGTGGAATATGGAGAATCTTCATACTGGCAAAGAAGATATATAGAGGGTC rFH2 . 7 

rFHl. 8 

. rFHl . 0 



1270 1280 1290 13 00 1310 132 0 

AGTCTGCAAAAGTCCAGTGTCACAGTGGCTATAGTCTTCCAAATGGTCAAGATACATATT rFH4 . 3 
AGTCTGCAAAAGTCCAGTGTCACAGTGGCTATAGTCTTCCAAATGGTCAAGATACATATT rFH2 . 7 

rFHl . 8 

rFHl . 0 



SCRB 

1330 1340 1350 1360 1370 1380 

ATTGTACAGAGAATGGCTGGTCCCCTCCTCCCAAATGCGTCCGTATCAAGACTTGTTCAG rFH4 . 3 
ATTGTACAGAGAATGGCTGGTCCCCTCCTCCCAAATGCGTCCGTATCAAGACTTGTTCAG rFH2 . 7 

rFHl. 8 

rFHl . 0 



13 90 1400 1410 1420 1430 1440 

TATCAGATATAGAAATTGAAAATGGGTTTTTTTCTGAATCTGATTATACATATGCTCTAA rFH4 . 3 
TATCAGATATAGAAATTGAAAATGGGTTTTTTTCTGAATCTGATTATACATATGCTCTAA rFH2 . 7 

rFHl. 8 
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1450 1460 1470 1480 1490 1500 

ATAGAAAAACACGGTATAGATGTAAACAGGGATATGTAACAAATACCGGAGAAATATCAG rFH4 . 3 
ATAGAAAAACACGGTATAGATGTAAACAGGGATATGTAACAAATACCGGAGAAATATCAG rFH2 . 7 

. rFHl. 8 

rFHl.O 



SCR9 

1510 1520 1530 1540 1550 1560 

GAATAATTACTTGTCTTCAAGATGGATGGTCACCTCGACCCTCATGCATTAAGTCTTGTG rFH4 . 3 
GAATAATTACTTGTCTTCAAGATGGATGGTCACCTCGACCCTCATGCATTAAGTCTTGTG rFH2 . 7 

rFH1.8 

rFHl . 0 



1570 1580 1590 1600 1610 1620 

ATATGCCTGTATTTGAGAATTCTATGACTAAGAATAATAACACATGGTTTAAACTCAATG rFH4 . 3 
ATATGC CTGTATTTGAGAATTCTATGACTAAG AATAATAAC ACATGGTTT AAACT CAAT G rFH2 . 7 

- rFHl . 8 

rFHl . 0 



1630 1640 1650 1660 1670 1680 

ACAAATTAGACTATGAATGTCACATTGGATATGAAAATGAATATAAACATACCAAAGGCT rFH4 . 3 

ACAAATTAGACTA': 



iTGAATGTCACATTGGATATGAAAATGAATATAAACATACCAAAGGCT rFH2 . 7 



rFHl. 8 
rFHl.O 



SCR10 

1690 1700 1710 1720 1730 1740 

CTATAACATGTACTTATGATGGATGGTCTAGTACACCCTCCTGTTATGAAAGAGAATGCA rFH4 . 3 
CTATAACATGTACTTATGATGGATGGTCTAGTACACCCTCCTGTTATGAAAGAGAATGCA rFH2 . 7 

rFHl. 8 

rFHl . 0 



1750 1760 1770 1780 1790 1800 

GCATTCCCCTGTTACACCAAGACTTAGTTGTTTTTCCCAGAGAAGTAAAATACAAAGTTG rFH4 . 3 
GC^TTCCCCTGTTACACCAAGACTTAGTTGTTTTTCCCAGAGAAGTAAAATACAAAGTTG rFH2 . 7 

rFHl . 8 
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1810 1820 1830 1840 1850 I860 

GAGATTCGTTGAGTTTCTCTTGCCGTTCAGGACACAGAGTTGGAGCAGATTTAGTGCAAT rFH4 . 3 
GAGATTCGTTGAGTTTCTCTTGCCGTTCAGGACACAGAGTTGGAGCAGATTTAGTGCAAT rFH2 . 7 

rFHl. 8 

r FH1 . 0 



SCRll 

1870 1880 1890 1900 1910 1920 

GCTACCACTTTGGATGGTCCCCTAATTTCCCAACGTGTGAAGGCCAAGTAAAATCATGTG rFH4 . 3 
GCTACCACTTTGGATGGTCCCCTAATTTCCCAACGTGTGAAGGCCAAGTAAAATCATGTG rFH2 . 7 

; - rFH1.8 

rFHl.O 



1930 1940 1950 1960 1970 1980 

ACCAACCTCTTGAAATCCCGAATGGGGAAATAAAGGGAACAAAAAAAGTTGAATACAGCC rFH4 . 3 
ACCAACCTCTTGAAATCCCGAATGGGGAAATAAAGGGAACAAAAAAAGTTGAATACAGCC rFH2 . 7 

rFHl . 8 

rFHl . 0 



1990 2000 2010 2020 2030 2040 

ATGGTGACGTGGTGGAATATGATTGCAAACCTAGATTTCTACTGAAGGGACCCAATAAAA rFH4 . 3 
ATGGTGACGTGGTGGAATATGATTGCAAACCTAGATTTCTACTGAAGGGACCCAATAAAA rFH2 . 7 

rFHl. 8 

_ rFHl . 0 



SOU 2 

2050 2060 2070 2080 2090 2100 

TCCAGTGTGTTGACGGGAAGTGGACAAGGTTGCCGATATGCGTTGAGTATGAGAGAACAT rFH4 . 3 
TCCAGTGTGTTGACGGGAAGTGGACAAGGTTGCCGATATGCGTTGAGTATGAGAGAACAT rFH2 . 7 

rFHl. 8 

rFHl.O 



2110 2120 213 0 2140 2150 2160 

GTGGAGACCTTCCTGAACTTGAGCATGGCTCTGTCAAGTTATCTGTCCCTCCCTACCATC rFH4 . 3 
GTGGAGACCTTCCTGAACTTGAGCATGGCTCTGTCAAGTTATCTGTCCCTCCCTACCATC rFH2 . 7 

rFHl. 8 
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2170 2180 2190 2200 2210 222 0 

ATGGAGATTCAGTGGAGTTCACTTGTACA rFH4 . 3 

ATGGAGATTCAGTGGAGTTCACTTGTACAGAAACCTTCACAATGATTGGACATGCAGTAG rFH2 . 7 

rFHl . 8 

rFHl.O 



SCR13 

2230 2240 2250 2260 2270 2280 

TTTTCTGCATTAGTGGAAGGTGGACCGAGCTTCCTCAATGTGTTGCAACAGATCAACTGG rFH4 . 3 
TTTTCTGCATTAGTGGAAGGTGGACCQAGCTTCCTCAATGTGTTGCAACAGATCAACTGG rFH2 . 7 

; - - rFHl . 8 

rFHl . 0 



2290 2300 2310 2320 2330 2340 

AGAAGTGTAAAGCCCCGAAGTCAACTGGCATAGATGCAATTCATCCAAATAAGAATGAAT rFH4 . 3 
AGAAGTGTAAAGCCCCGAAGTCAACTGGCATAGATGCAATTCATCCAAATAAGAATGAAT rFH2 . 7 

rFHl . 8 

rFHl.O 



2350 2360 2370 2380 2390 2400 

TTAATCATAACTTTAGTGTGAGTTACAGATGTAGACAAAAGCAGGAGTATGAACATTCAA rFH4 . 3 
TTAATCATAACTTTAGTGTGAGTTACAGATGTAGACAAAAGCAGGAGTATGAACATTCZAA rFH2 . 7 

rFHl . 8 

rFHl.O 



SCR14 

2410 2420 2430 2440 2450 2460 

TCTGCATCAATGGAAGATGGGATCCTGAACCAAACTGTACAAGCAAAAGATTCTGCCCTC rFH4 . 3 

TCTGCATCAATGGAAGATGGGATCCTGAACCAAACTGTACAAGCAAAAGATTCTGCCCTC rFH2 . 7 

rFHl . 8 

rFHl . 0 



2470 2480 2490 2500 2510 2520 

CTCCCCCGCAGATTCCAAATGCCC^GTGATTGAAACCACCGTGAAATACTTGGATGGAG rFH4 . 3 
CTCCCCCGCAGATTCCAAATGCCCAAGTGATTGAAACCACCGTGAAATACTTGGATGGAG rFH2 - 7 

rFHl . 8 
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2530 2540 2550 2560 2570 2580 

AAAAAGTATCTGTTCTTTGCCAAGATGGTTACCTAACTCAGGGCCCAGAAGAAATGGTGT rFHl . 8 
AAAAAGTATCTGTTCTTTGCCAAGATGGTTACCTAACTCAGGGCCCAGAAGAAATGGTGT rFH2 . 7 

.. rFH4 .3 

rFHl . 0 



sous 

2590 2600 2610 2620 2630 2640 

GTAAACATGGAAGGTGGCAGTCGTTACCACGCTGCACGGAAAAAATTCCATGTTCCCAGC rFH4 . 3 
GTAAACATGGAAGGTGGCAGTCGTTACCACGCTGCACGGAAAAAATTCCATGTTCCCAGC rFH2 . 7 

rFHl . 8 

rFHl . 0 



2650 2660 2670 268 0 2690 2700 

CCCCTAAAATTGAACATGGATCTATTAAGTCGCCCAGGTCCTCAGAAGAGAGGAGAGATT rFH4 . 3 
CCCCTAAAATTGAACATGGATCTATTAAGTCGCCCAGGTCCTCAGAAGAGAGGAGAGATT rFH2 . 7 

rFHl . 8 

rFHl . 0 



2710 2720 2730 2740 2750 2760 

TAATTGAGTCCAGCAGTTATGAACACGGAACTACATTCAGCTATTGCTGTAGAGATGGAT rFH4 . 3 
TAATTGAGTCCAGCAGTTATGAACACGGAACTACATTCAGCTATTGCTGTAGAGATGGAT rFH2 . 7 

rFHl. 8 

rFHl . 0 



2770 2780 2790 2800 2810 2820 

TCAAGATATCTGAAGAAAATAGGGTAACCTGCAACATGGGAAAATGGAGCTCTCTGCCTC rFH4 . 3 
TCAAGATATCTGAAGAAAATAGGGTAACCTGCAACATGGGAAAATGGAGCTCTCTGCCTC rFH2 . 7 

rFHl . 8 

__ rFHl.O 



SOU 6 

2830 2840 2850 2860 2870 2880 

GTTGTGTTGGAATACCTTGTGGACCCCCACCTTCAATTCCTCTTGGTATTGTTTCTCATG rFH4 . 3 

GTTGTGTTGGAAT AC CTTGTGGAC C C C CAC CTTCAATTC CT CTTGGTATTGTTT CTCATG rFH2 . 7 

rFHl . 8 
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2890 2900 2910 2920 2930 2940 

AACTAGAAAGTTAC CAATATGGAGAGGAGGTT ACATACAATTGTTCTGAAGG CTTTGGAA rFH4 . 3 

JTTACCAATATGGAGAGGAGGTTACATACAATTGTTCTGAAGGCTTTGGAA rFH2 . 7 



AACTAGAAAG' 



rFH1.8 
rFHl.O 



2950 2960 2970 2980 2990 3000 

TTGATGGACCAGC^TTTATTAAATGTGTAGGAGGACAGTGGTCTGAACCTCCCAAATGCA rFH4 . 3 
TTGATGGACCAGCATTTATTAAATGTGTAGGAGGACAGTGGTCTGAACCTCCCAAATGCA rFH2 . 7 

rFHl . 8 



rFHl. 0 



SCR17 

3010 3020 3030 3040 3050 3060 

TAAAAACTGATTGTGACAACTTGCCCACATTTGAAATTGCCAAACCGACAGAAAAGAAAA rFH4 . 3 

TAAAAACTGATTGTGACAACTTG C C CACATTTGAAATTGC CAAAC C GACAGAAAAGAAAA rFH2 . 7 

rFHl . 8 



rFHl.O 



3070 3080 3090 3100 3110 3120 

AAAAATCATACAGG 



TCAGGAGAACAAGTGACATTCAGATGTCCACCTCCGTATCGAATGG rFH4 . 3 



rFH2 . 7 

TATCGAATGG rFHl . 8 
- rFHl.O 



ATGGCTCTGA 



3130 3140 3150 3160 3170 3180 

CATTGTCACATGTGTTAATACGAAGTGGATTGGACAGCCGGTATGCAAAG rFH4 . 3 



rFH2.7 
rFHl. 8 



ATGGCTCTGACATTGTCACATGTGTTAATACGAAGTGGATTGGACAGCCGGTATGCAAAG 

rFHl . 0 



SC3U8 



3190 3200 3210 3220 3223 3240 

ATAATTCCTGTGT 



GAATCCACCACATGTGCCAAATGCTACTATACTAACAAGGCACAAGA rFH4 . 3 



- - rFH2.7 

AT AATT C CTGTGTGAAT C CACCACATGTG C CAAATGCT ACTATACTAACAAGG CACAAGA rFHl . 8 

rFHl . 0 



WO 98/23638 



- 10/15 - 



PCT/GB97/03275 



3250 3260 3270 3280 3290 3300 
CTAAATATCCATCTGGTGACAAAGTACGTTATGACTGTAATAAACCTT1TGAATTATTTG rFH4 . 3 
rFH2.7 

CTAAATATCCATCTGGTGACAAAGTACGTTATGACT rFHl . 8 
rFHl . 0 



3310 3320 3330 3340 3350 3360 

GGGAAGTGGAAGTGATGTGCCAAAACGGGATTTGGACAGAACCACCGAAATGCAAAGATT rFH4 . 3 

rFH2 . 7 

GGGAAGTGGAAGTGATGTGCCAAAACGGGATTTGGACAGAACCACCGAAATGCAAAGATT rFHl . 8 

rFHl . 0 



SCR19 

3370 3380 3390 3400 3410 3420 

CAACAGGGAAATGTGGGCCTCCTCCACCTATTGACAATGGAGACATCACCTCCTTGTCAT rFH4 . 3 

rFH2 . 7 

CAACAGGGAAATGTGGGCCTCCTCCACCTATTGACAATGGAGACATCACCTCCTTGTCAT rFHl . 8 

rFHl . 0 



3430 3440 3450 3460 3470 3480 

TACCAGTATATGCACCATTATCATCAGTTGAATATCAATGCCAGAACTATTATCTACTTA rFH4 . 3 

rFH2 . 7 

TACCAGTATATGCACCATTATCATCAGTTGAATATCAATGCCAGAACTATTATCTACTTA rFHl . 8 

rFHl . 0 



3490 3500 3510 3520 3530 3540 
AGGGAAATAAGATAGTAACATGTAGAAATGGAAAGTGGTCTCAGCCACCAACCTGCTTAC rFH4 . 3 
rFH2.7 

AGGGAAATAAGATAGTAACATGTAGAAATGGAAAGTGGTCTCAGCCACCAACCTGCTTAC rFHl . 8 

rFHl . 0 



SCR20 

3550 3560 3570 3580 3590 3600 
ATGCATGTGTGATAC CAGAAGATATT ATGGAAAAACATAATATAGTT CT CAGATGGAGGG rFH4 . 3 
_ _ rFH2.7 

ATGCATGTGTGATACCAGAAGATATTATGGAAAAACATAATATAGTTCTCAGATGGAGGG rFHl . 8 

rFHl.O 



WO 98/23638 



- 11/15 



PCT/GB97/03275 



3610 3620 363 0 3640 3650 3660 

AAAATGCAAAGATTTATTCCCAA.TCAGGGGAGAATATTGAATTCATGTGTAAACCTGGAT rFH4 . 3 

rFH2.7 

AAAATGCAAAGATTTATT C C CAAT CAGGGGAGAATATTGAATTCATGTGTAAAC CTGGAT rFHl . 8 

GGAT rFHl.O 



3670 3680 3690 3700 3710 3720 

ATAGAAAATTCAGAGGATCACCTCCGTTTCGTACAAAGTGCATTGAGGGTCACATCAATT rFH4 . 3 

_ „ . „ , rFH2 .7 

ATAGAAAATTCAGAGGATCACCTCCGTTTCGTACAAAGTGCATTGAGGGTCACATCAATT rFHl . 8 
ATAGAAAATTCAGAGGATCACCTCCGTTTCGTACAAAGTGCATTGAGGGTCACATCAATT rFHl . 0 



3730 3740 3750 3760 3770 3780 

ATCCCACTTGTGTATMaatcgctatacaattattagtaaaccttatggatgagaaatgc rFH4 . 3 

rFH2.7 

ATCCCACTTGTGTATMaatcgctatacaattattagtaaaccttatggatgagaaatgc rFHl . 8 
ATCCCACTTGTGTASMaatcgctatacaattattagtaaaccttatggatgacactttg rFHl . 0 



3790 3800 3810 3820 3830 3840 

acatgtatattactaatacagtttgaatttacatttaaatattgtttagctcatttcctc rFH4 . 3 

rFH2 . 7 

acatgtatattactaatacagtttgaatttacatttaaatattgtttagctcatttcctc rFHl . 8 
tttagaaatgcacatgtatattactaatacagtttgaatttacatttgaaaaa rFHl . 0 



3850 3860 3870 38 8 0 3 8 90 3 900 

taataagtatataaactttttttatatggtggttaatcagtaactttacagactgttgcc rFH4 . 3 

-- rFH2.7 

taataagtatataaactttttttatatggtggttaatcagtaactttacagactgttgcc rFHl . 8 

- rFHl.O 



WO 98/23638 



PCT/GB97/03275 



- 12/15 - 



3910 3920 3930 3940 3950 3960 

acaaagcaa'gaacattacattcaaaactcctaatccaaatatgatatgtccaaggacaaa rFH4 . 3 

rFH2 . 7 

acaaagcaagaacattacattcaaaactcctaatccaaatatgatatgtccaaggacaaa rFHl . 8 

_ rFHl.O 



3970 3980 3990 4000 4010 4020 

ctatgtctaagcaagaaaataaatgttagttcttcaatgtctgtttttattcaggacctt rFH4 . 3 



rFH2 .7 



ctatgtctaagcaagaaaataaatgttagttcttcaatgrctgttttnattcaggacctt rFHl . 8 

, rFHl.O 



4030 4040 4050 4060 4070 4080 

tcagattttcttggataccttttgttaggttctgattcacagtgagtggaagacacactg rFH4 . 3 



rFH2 .7 



tcagattttcttggataccttttgttaggttctgattcacagtgagtggaagacacactg rFHl . 8 

_ rFHl . 0 



4090 4100 4110 4120 4130 4140 

actctgacttcaaattagtattacttgcaatacattaacaaccaaactatcataatatca rFH4 . 3 



rFH2 .7 



actctgacttcaaattagtattacttgcaatacattaacaaccaaactatcataatatca rFHl . 8 

rFHl.O 



4150 4160 4170 4180 4190 4200 

caaatgtatacagctaattactgtgtcctacctttgtatcaa taaagaaatctaagaaag rFH4 . 3 



4210 4220 4230 

ttcttgcttaaaaaaaaaaaaaaaaaaaa 

ttcttgcttaaaaaaaaaaaaaaaaaaaa 



rFH2 .7 



caaatgtatacagctaattactgtgtcctacctttgtatcaa taaagaaatctaagaaag rFHl . 8 

rFHl . 0 



rFH4.3 
rFH2.7 
rFHl. 8 
rFHl. 0 



